Genetic Algorithm Wrappers For Feature Subset Selection In Supervised Inductive Learning

نویسندگان

  • William H. Hsu
  • Cecil P. Schmidt
  • James A. Louis
چکیده

1. Inferential loss: Quality of the model produced by an inducer as detected through inferential loss evaluated over a holdout validation data set Dval ≡ D \ Dtrain 2. Model loss: “Size” of the model under a specified coding or representation 3. Ordering loss: Inference/classificationindependent and model-independent measure of data quality given only training and validation data D and hyperparameters ÿ

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Control of Inductive Bias in Supervised Learning using Evolutionary Computation: A Wrapper-Based Approach

In this chapter, I discuss the problem of feature subset selection for supervised inductive learning approaches to knowledge discovery in databases (KDD), and examine this and related problems in the context of controlling inductive bias. I survey several combinatorial search and optimization approaches to this problem, focusing on datadriven validation-based techniques. In particular, I presen...

متن کامل

Genetic Algorithms for Reformulation of Large-Scale KDD Problems with Many Irrelevant Attributes

The goal of this research is to apply genetic implementations of algorithms for selection, partitioning, and synthesis of attributes in largescale data mining problems. Domain knowledge about these operators has been shown to reduce the number of fitness evaluations for candidate attributes. We report results on genetic optimization of attribute selection problems and current work on attribute ...

متن کامل

Genetic Wrappers for Constructive Induction in High-Performance Data Mining

We present an application of genetic algorithmbased design to configuration of high-level optimization systems, or wrappers, for relevance determination and constructive induction. Our system combines genetic wrappers with elicited knowledge on attribute relevance and synthesis. We discuss decision support issues in a largescale commercial data mining project (cost prediction for multiple autom...

متن کامل

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002